Fix thread-safety bug in `geom.c` by oskooi · Pull Request #75 · NanoComp/libctl

oskooi · 2026-04-26T15:53:38Z

Root cause: Two thread-safety bugs in utils/geom.c, exposed by Meep's recently added OMP parallization of set_chi1inv (NanoComp/meep#3166).

Fix 1 (primary — caused the crash of test_mode_coeffs.py): Remove the geom_fix_object_ptr(&o) call from geom_get_bounding_box(). For Prism objects, this call triggered reinit_prism(), which frees and re-mallocs all internal prism arrays. When multiple OMP threads call this concurrently on the same Prism, it causes double-free/use-after-free, corrupting heap metadata and producing the free(): unaligned chunk detected in tcache 2 error.

Fix 2 (secondary — would cause incorrect results): Change intersect_line_segment_with_prism() to use a stack-allocated variable-length array (VLA) (double slist[num_vertices + 2]) instead of the shared prsm->workspace.items buffer. This prevents data races during concurrent intersection calculations.

Verification: all 6 tests in Meep's test_mode_coeffs.py now pass with OMP_NUM_THREADS=1, 2, and 4.

@Luochenghuang

stevengj · 2026-04-26T21:20:52Z

Fix 1 (primary — caused the crash of test_mode_coeffs.py): Remove the geom_fix_object_ptr(&o) call from geom_get_bounding_box(). For Prism objects, this call triggered reinit_prism(), which frees and re-mallocs all internal prism arrays. When multiple OMP threads call this concurrently on the same Prism, it causes double-free/use-after-free, corrupting heap metadata and producing the free(): unaligned chunk detected in tcache 2 error.

This changes the semantics. But it's probably a good idea anyway since fix_object is expensive and should really only be done once at the beginning of all geometry calculations, and in fact we are already doing this.. (In Meep, it is called in the geom_epsilon constructor, and in MPB and pympb it is called in init_epsilon.

stevengj · 2026-04-26T21:21:32Z

Change intersect_line_segment_with_prism() to use a stack-allocated variable-length array (VLA) (double slist[num_vertices + 2]) instead of the shared prsm->workspace.items buffer.

This is potentially dangerous for a large prism because stack space is limited. In particular, it could crash for prisms more than about 100,000 elements.

stevengj · 2026-04-26T21:36:53Z

For the prism list, in practice the slist normally doesn't need to be equal to the number of vertices or anything close, since it is very unlikely you will have so many intersections. What we could do to make it both thread-safe and memory-safe would be something like:

Stack allocate a moderate number of intersections, e.g. double slist_stack[1024]; double *slist = slist_stack;, that will easily fit in the stack.
In intersect_line_with_prism, add an extra argument int slist_len for the length of slist (passing 1024)
Change the line from slist[num_intersections++] = s; to something like if (num_intersections++ < slist_len) slist[num_intersections-1] = s. Then, at the end of that loop, add a check if (num_intersections > slist_len) return num_intersections; to return immediately
In the caller intersect_line_segment_with_prism, check if (num_intersections > 1024) { slist = (double *) malloc(num_intersections * sizeof(double)); num_intersections = intersect_line_with_prism(prsm, pc, dc, slist, num_intersections); } to re-do the intersect_line_with_prism calculation with a heap-allocated array, as a slow fallback path. At the end of the function, also add if (num_intersections > 1024) free(slist); to deallocated it in this case.
Get rid of prsm->workspace, which is no longer used.

stevengj · 2026-05-01T19:39:29Z

I implemented my proposal and pushed a commit to this PR.

oskooi · 2026-05-08T06:30:35Z

The latest changes (c617512) are producing failures for the Meep unit test test_mode_coeffs.py when OMP_NUM_THREADS is set to 2 or 4:

OMP_NUM_THREADS=2:

malloc(): unaligned tcache chunk detected

OMP_NUM_THREADS=4:

degenerate plane in project_point_into_plane

intersect_line_segment_with_mesh added (b - last_s) when a hit fell at or past b, then the post-loop cleanup added the same length again because inside/last_s were not updated in the break branch. Drop the in-branch addition and let the post-loop check handle the inside-at-b case. Add test_cube_segments_vs_block: 1000 random unit-direction line segments through a unit cube mesh vs make_block, comparing interior lengths within 1e-4. This test surfaced the bug; max diff after the fix is ~4e-16. Also note that remove_duplicate_intersections should be unified with the prism slist dedup after NanoComp#75.

fix thread-safety bug in geom.c

0438715

oskooi added the bug label Apr 26, 2026

safer stack allocation

c617512

stevengj mentioned this pull request May 1, 2026

Mesh geometry import #74

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix thread-safety bug in `geom.c`#75

Fix thread-safety bug in `geom.c`#75
oskooi wants to merge 2 commits intoNanoComp:masterfrom
oskooi:thread_safety_fix

oskooi commented Apr 26, 2026 •

edited

Loading

Uh oh!

stevengj commented Apr 26, 2026

Uh oh!

stevengj commented Apr 26, 2026 •

edited

Loading

Uh oh!

stevengj commented Apr 26, 2026 •

edited

Loading

Uh oh!

stevengj commented May 1, 2026

Uh oh!

oskooi commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

oskooi commented Apr 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevengj commented Apr 26, 2026

Uh oh!

stevengj commented Apr 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevengj commented Apr 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevengj commented May 1, 2026

Uh oh!

oskooi commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

oskooi commented Apr 26, 2026 •

edited

Loading

stevengj commented Apr 26, 2026 •

edited

Loading

stevengj commented Apr 26, 2026 •

edited

Loading